Voice Activity Detection based on Inverse Normalized Noise Likelihood Estimation

نویسندگان

  • Tomas Dekens
  • Mike Demol
  • Werner Verhelst
  • Frédéric Beaugendre
چکیده

In this paper we develop a voice activity detection algorithm based on the likelihood that only noise is present in the current signal frame. For this we exploit the fact that the Fourier coefficients of most noise processes can be modeled as statistically independent Gaussian random variables. We also give an overview of different voice activity detectors previously described in the literature and compare their results to the ones obtained with the voice activity detector we propose in this paper. According to our tests, at high speech detection probabilities, the proposed algorithm shows results than are comparable to or better than the other voice activity detectors we consider, while the simplicity of the algorithm ensures low computational complexity. Key Words—Noise estimation, Speech enhancement, Voice activity detection.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)

Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...

متن کامل

Spectral subtraction with full-wave rectification and likelihood controlled instantaneous noise estimation for robust speech recognition

In standard Spectral Subtraction (SS), Half-Wave Rectification SS (HWR-SS) is normally applied to avoid negative values in the Power Spectral Density (PSD) that occur mainly due to inaccurate noise estimation caused by a Voice Activity Detector (VAD). In this paper analyses show that, given accurate noise estimation, the phase relationship between speech and noise becomes the dominant cause of ...

متن کامل

Entropy based voice activity detection in very noisy conditions

This paper addresses the problem of robust voice activity detection (VAD) capable for working at very low signal-to-noise ratios (SNR<10dB). A new algorithm that we propose is based on entropy estimation measures of the time-frequency magnitude spectrum. The problem of the estimation of the distribution of noise in detected non-speech segments of analysed signal is also presented. It is shown t...

متن کامل

Voice Activity Detection Using Laplacian Model and UMP Test

This paper presents a new voice activity detection (VAD) method using the Laplacian distribution and a uniformly most powerful (UMP) test. The UMP test is employed to derive the new decision rule based on likelihood ratio test (LRT). The proposed method provide the decision rule by comparing the sum of magnitude of real and imaginary parts of the noisy spectral component to the adaptive thresho...

متن کامل

Noise-robust hands-free voice activity detection with adaptive zero crossing detection using talker direction estimation

This paper proposes a novel hands-free voice activity detection (VAD) method utilizing not only temporal features but also spatial features, called adaptive zero crossing detection (AZCD), that uses talker direction estimation. It firstly estimates talker direction to extract two spatial features: spatial reliability and spatial variance, based on weighted cross-power spectrum phase analysis an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007